Bootstrapping cluster analysis: assessing the reliability of conclusions from microarray experiments.
نویسندگان
چکیده
We introduce a general technique for making statistical inference from clustering tools applied to gene expression microarray data. The approach utilizes an analysis of variance model to achieve normalization and estimate differential expression of genes across multiple conditions. Statistical inference is based on the application of a randomization technique, bootstrapping. Bootstrapping has previously been used to obtain confidence intervals for estimates of differential expression for individual genes. Here we apply bootstrapping to assess the stability of results from a cluster analysis. We illustrate the technique with a publicly available data set and draw conclusions about the reliability of clustering results in light of variation in the data. The bootstrapping procedure relies on experimental replication. We discuss the implications of replication and good design in microarray experiments.
منابع مشابه
Randomized maps for assessing the reliability of patients clusters in DNA microarray data analyses
OBJECTIVE Clustering algorithms may be applied to the analysis of DNA microarray data to identify novel subgroups that may lead to new taxonomies of diseases defined at bio-molecular level. A major problem related to the identification of biologically meaningful clusters is the assessment of their reliability, since clustering algorithms may find clusters even if no structure is present. METH...
متن کاملMAANOVA: A Software Package for the Analysis of Spotted cDNA Microarray Experiments
ii ABSTRACT We describe a software package called MAANOVA, for MicroArray ANalysis Of VAriance. MAANOVA is a collection of functions for statistical analysis of gene expression data from two-color cDNA microarray experiments. It is available in both the Matlab and R programming environments and can be run on any platform that supports these packages. MAANOVA allows the user to assess data quali...
متن کاملMixture modelling of gene expression data from microarray experiments
MOTIVATION Hierarchical clustering is one of the major analytical tools for gene expression data from microarray experiments. A major problem in the interpretation of the output from these procedures is assessing the reliability of the clustering results. We address this issue by developing a mixture model-based approach for the analysis of microarray data. Within this framework, we present nov...
متن کاملAssessing Gene Expression Measurements: Ml and Bayesian Techniques
Gene array studies enable assessment of expression patterns of thousands of genes over time and under multiple conditions. The analysis of these patterns requires detecting whether observed differences in expression levels are significant or not. To perform the analysis, we must first normalize the data. Normalization is the term used to describe the process of removing differences of measureme...
متن کاملMolecular Characterization and Phylogeny Analysis Based on Sequences of Cytochrome Oxidase gene From Hemiscorpius lepturus of Iran
Abstract: Background: Hemiscorpius lepturus is a medically important scorpion found along the Iranian borders, especially near to Khuzestan Province in the south-west of Iran. This is the only non-buthid scorpion which is potentially lethal in southern Iran and is responsible for severe dermonecrotic scorpionism. OBJECTIVES: In this study, DNA fragment of the mitochondrial cytochrome c oxidase ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Proceedings of the National Academy of Sciences of the United States of America
دوره 98 16 شماره
صفحات -
تاریخ انتشار 2001